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We develop a simple and intuitive identity for calculating expectations 
of weighted fc-fold sums over particles in branching processes, generalising 
the well-known many-to-one lemma. 

1 Introduction 

1.1 The many-to-two lemma 

Consider a branching Brownian motion: one particle starts at and moves like 
a Brownian motion until an exponentially distributed time with mean 1. It 
then dies and leaves in its place two new particles, which independently follow, 
relative to their initial position, the same random behaviour as their parent. Let 
N{t) be the set of particles alive at time t, and for a particle u G N{t) let Xu{t) 
be the position of particle u. Let -Bj, t > be a standard Brownian motion, and 
/ : M ^ M a measurable function. The following result is well-known (see [3]): 

Lemma 1 (Simple many-to-one lemma). 



This lemma turns questions about a system of many dependent particles into 
questions about a single Brownian motion. For example, let A{x, t) = #{u G 
N{t) : Xu{t) > x}, the number of particles that are above x at time t. For 
which X and t is A{x,t) non-zero? (This question is related to solutions of the 
FKPP equation.) Markov's inequality and the many-to-one lemma give us an 
easy upper bound: 
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For a lower bound, one would like to use a second-moment method, applying 

To calculate E[yl(a;,t)^] we should use a many-to-two lemma. 
Lemma 2 (Simple many-to-two lemma). For measurable f and g, 
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with T exponentially distributed with parameter 2 and Wt, t > a standard 
Brownian motion independent of Bt . 

The main result of this article will be the many-to-few lemma. Lemma |3j 
which is a much more general version of Lemma [2j In fact we will be able to 
calculate sums over arbitrarily many particles. We also incorporate the pos- 
sibility of using a change of measure for the motion of the particles which in 
applications often results in easier calculations. 

We prove our main result, Lemma [3] by developing a theory of multiple 
spines. It was not at all obvious that such a theory would even exist, let alone 
how to construct it so that it would lead to a clean and general many-to-few 
result. The single spine was introduced by Chauvin and Rouault [5], developed 
by Lyons et al. and has since proved very useful in many situations in 

branching processes. 

In special cases, results similar to Lemma [3] have existed for some time in 
various formtj^ usually proved by arguments specific to the particular model 
or problem and without any underlying spine framework. Our article provides 
several advantages over these previous results. We state Lemma [3] for a general 
model, and the spine setup gives an intuitive backdrop allowing one to transfer 
the result to other processes. Also, to our knowledge there is no existing work 
that incorporates changes of measure. 

There are already several applications of this theory underway. Roberts [T^] 
uses the full power of our general many-to-two lemma to give simple proofs of 
large-time asymptotics on the log scale for branching Brownian motion. Ai'dekon 
and Harris the fc-particle version to compute moments in order to show 

that the number of particles hitting a certain level in a branching Brownian mo- 
tion with killing at the origin converges in distribution in the limit approaching 
criticality. Doring and Roberts [7] calculate moments of numbers of particles 



^An even simpler form of LemmajSjwas given by Sawyer |14|. Kallenberg 10 proved a ver- 
sion for discrete trees, which he calls a "backward tree formula" . Gorostiza and Wakolbinger 
[5] extend Kallenberg's formula to a class of continuous-time processes. Dawson and Perkins 
generate what they call "extended Palm formulas" for historical processes (superprocesses en- 
riched with information on genealogy) in [6] . For the parabolic Anderson model with WeibuU 
upper tails, Albeverio et al. [2| gave a similar result by considering existence and uniqueness of 
solutions to a Cauchy problem. Bansaye et al. [1] develop quite general many-to-two lemmas 
for Markov branching processes, allowing particles to be born away from their parent. 
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in a catalytic branching model, for which the multiple spine theory gives an 
intuitive combinatorial derivation for a collection of constants which otherwise 
appear abstractly from the analysis. Ortgiese and Roberts (work in progress) 
also apply the A:-particle version to the parabolic Anderson model to show that 
the large-time behaviour of the underlying branching process is rather different 
from that anticipated by its moments. 

The article is arranged as follows. In Section [2] we give a summary of the multi- 
spine setup, and then state our main result in Section [3j In Section |4] we give 
full constructions of the measures and filtrations used in the theory, and then 
prove the many-to-few lemma in Section [Sj Finally, in Section [6] we state a 
discrete-time version of the many-to-few lemma. 

2 Multiple spines 

We consider a branching process starting with one particle at x under a probabil- 
ity measure P^;. This particle moves within a measurable space (J, B) according 
to a Markov process with generator C. When at position y, a particle branches 
at rate R{y) , dying and giving birth to a random number of new particles with 
distribution (which has support on {0, 1,2,.. .}). Each of these particles then 
independently repeats the stochastic behaviour of its parent from its starting 
point. 

We denote by N{t) the set of all particles alive at time t. For a particle 
u e N{t) we let cr„ be the time of its birth and t„ the time of its death, and 
define au{t) — au A t and r„(t) = r„ At. If u € N{t) then for s < t wc write 
X„(s) for the position of the unique ancestor of u alive at time s. If u has 
children then we write Xu{s) — A for all t > Tu, where A ^ J is a graveyard 
state. 

2.1 The fc-spine measures and 

We define new measures and under which there are k distinguished lines 
of descent which we call spines. Under P^ particles behave as follows: 

• We begin with one particle at position x which (as well as its position) 
carries k marks 1, 2, . . . , fc. 

• All particles move as Markov processes with generator C, independently 
of each other given their birth times and positions, just as under P^;. 

• We think of each of the marks 1, . . . , /c as distinguishing a particular line 
of descent or "spine" , and define £,1 to be the position of whichever particle 
carries mark i at time t. 

• A particle at position y carrying j marks bi < b2 < ■ ■ ■ < bj at time t 
branches at rate R{y), dying and being replaced by a random number of 
particles with law fiy independently of the rest of the system, just as under 
P.. 

• Given that a particles vi, . . . ,Va are born at a branching event as above, 
the j spines each choose a particle to follow independently and uniformly 
at random from amongst the a available. Thus for each 1 < I < a and 
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1 ^ * ^ j the probability that vi carries mark i just after the branching 
event is 1/a, independently of all other marks. 

• If a particle carrying j > marks 61 < 62 < • ■ • < &j dies and is replaced 
by particles, then its marks remain with it as it moves to the graveyard 
state A. 

In other words, under the system behaves exactly as under Pa,; the only 
difference is that some particles carry extra marks showing the lines of descent 
of k spines. We call the collection of particles that have carried at least one 
spine up to time t the skeleton at time t, and write skel(t); see Figure [T] Of 
course P^ is not defined on the same cr-algebra as P^: . We let be the filtration 
containing all information about the system (including the k spines) up to time 
t; then Pj; is defined on J^^. This will be clarified in Section [4] 




Figure 1: Each particle in the skeleton is a different colour, and particles not 
in the skeleton are drawn in grey. The numbers show how many spines are 

carried by each particle. 



Now, for each n > and y £ M let 

m"(y) 



the nth moment of the offspring distribution. Let 



fj,y is called the nth size-biased distribution with respect to fiy. For 1 < i, j < k 
define T{i,j) to be the first split time of the ith and jth spines, i.e. the first 
time at which marks i and j are carried by different particles. Let D{v) be the 
total number of marks carried by particle v. 

Suppose that C{X,t) is a functional of a process {Xt,t > 0) such that if 
{Xt, i > 0) is a Markov process with generator B then C(-^: t) is a non-negative 
unit-mean martingale with respect to the natural filtration of (Xt ,t > 0). For 



example if X is a Brownian motion on . 
Under particles behave as follows: 



then we might take C(^i t) = e 



Xt-t/2 



• We begin with one particle at position x which (as well as its position) 
carries k marks 1,2, ... ,k. 
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• Just as under P^, we think of each of the marks 1, . . . , fc as a spine, with 
Q the position of whichever particle carries mark i at time t. 

• A particle with mark i at time t moves as if under the changed measure 

'3^U(^^:,s<t) := C{C,t)Px\criii,s<t)- 

• A particle at position y carrying j marks at time t branches at rate 
m^{y)R{y), dying and being replaced by a random number of particles 
with law yU^ independently of the rest of the system. 

• Given that a particles vi, . . . ,Va are born at such a branching event, the 
J spines each choose a particle to follow independently and uniformly at 
random, just as under P^. 

• Particles not in the skeleton (those carrying no marks) behave just as 
under P, branching at rate R{y) and giving birth to numbers of particles 
with law fly when at y. 

In other words, under spine particles move as if weighted by the martingale 
they breed at an accelerated rate; and they give birth to size-biased numbers 
of children. 



3 The many-to-few lemma 

We note here that if Y is measurable with respect to J-^ , then it can be expressed 
as the sum 

vi,...,v^eN{t)u{A} 

where each Y(vi, . . . , v^) is J-"t-measurable. To see this one can generalize the 
argument on pages 24-25 of |13| . Since this is a purely measure-theoretic argu- 
ment and will be clear for most Y of interest, we leave it as an exercise for the 
reader. We now state our main result in full. 

Lemma 3 (Many-to-few). For any k > 1 and -measurable Y as above, 

y(wi, . . . , Vk)l{Q{v,.t)>0 Vi=l,...,/c} 

vi,...,VkeN{t) 

Note that under Q^, all particles v G skel(i) have Ci^^jt) > for all t and 
hence under P^^ only those particles u with C{u,t) > can possibly be counted. 
This is why the indicator function appears in the P^j-expectation on the left- 
hand side. In applications, if we wish to introduce a model incorporating killing 
of particles, this can be achieved by allowing ^ to become zero so that we only 
count those particles still alive at time t. 



4 Multiple spines and changes of measure 

Our main aim in this section is to give full details of the setup introduced in 
Section O 
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4.1 Trees 



We use the Ulam-Harris labelling system: define a set of labels 

fl := {0}u y N". 

We often call the elements of J7 particles. Wc think of as our inital ancestor, 
and a label (3, 2, 7) for example as representing the seventh child of the second 
child of the third child of the initial ancestor. For a particle u G fl we define 
|u|, the generation of u, to be the length of u (so if m € N" then \u\ = n, and 
|0| = 0). For two labels u,v £ Q we write uv for the concatenation of u and v, 
taking %u = u$ = u. We write u<v and say that u is an ancestor of v if there 
exists w; G such that uw = v. 

Wc define T to be the set of all trees: subsets r C Q such that 

• e r: the initial ancestor is part of r; 

• for all u,v G fl, uv G T u G t: if t contains a particle then it contains 
all the ancestors of that particle; 

• for each uGt, there exists A„ G {0, 1,2,.. .} such that for j G N, uj e r if 
and only if 1 < j < A„: each particle in r has a finite number of children. 

4.2 Marked trees 

Since wc wish to have a particular view of trees, as systems evolving in time 
and space, we define a marked tree to be a set T of triples of the form (it, /„, X„) 
such that uGfl, the set 

tree(r) := {u : 3 such that (m, X^) G T} 

forms a tree, lu G [0,oo) is the lifetime of u, and, setting cr„ := J2v<u^v and 

Xu ■ [o-„,T„) J 

is the position function of u. We think of the inital ancestor moving around 
in space according to its position function X^ until time Zg. It then disappears 
and a number of new particles appear; each moves according to its position 
function for a period of time equal to its lifetime, before being replaced by a 
number of new particles; and so on. 

We let T be the set of all marked trees, and for T e T we define 

N{t) := {u G tree(r) : ct„ < t < r„}, 

the set of particles alive at time t. For convenience, wc extend the position path 
of a particle v to all times t E [0,t„), to include the paths of all its ancestors: 

X (t) ■= / ^""^^^ iiay<t<Ty 
""^ ^ ' \ Xu{t) \iu<v and a-„ < t < t„ 

and \t Ay = Q then we write Xy{t) = A > r^. 
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4.3 Marked trees with spines 

We now enlarge our state space further to include the notion of spines. A spine 
^ on a marked tree r is a subset of tree(T) such that 

• ^ n {N{t) U {A}) contains exactly one particle for each t; 

• if w G ^ and u < v then u £ ^; 

• if f e ^ and > 0, then 3j e {!,..., Ay} such that vj E ^; otherwise 

^nN{t) = vt > Ty. 

If w e ^ n N{t) then we define ■= Xy{t), the position of the spine at time t. 
Sometimes we shall use the notation to mean the particle v itself — beyond 
this introduction it should always be clear from the context which meaning is 
intended. For clarity within this section we will use the notation node(^t) to 
denote the particle v itself — that is, the unique v G N{t) n We say that a 
marked tree with spines is a sequence (r, ■ ■ ■) where r G T is a marked 

tree and ... are spines on r. We let T be the set of all marked trees with 

spines. 

4.4 Filtrations 

We now work exclusively on the space T of marked trees with spines, and use dif- 
ferent filtrations on this space to encapsulate different amounts of information. 
We give descriptions of these filtrations below; formal definitions are similar to 
those in [13] and are left to the reader. 

The filtration {J^t,t > 0): We define {Tt,t > 0) to be the natural filtration of 
the branching process - it does not know anything about the spines. 

The filtrations {Tf,t > 0): For each A; > 1 we let {Tt,t > 0) be the natural 
filtration for the branching process and the first k spines. It does not know 
anything about spines ■ ■ ■ ■ 

The filtrations {Gi,t > 0): For each j we define Qf := a {£,i,s G [0,t]), where 
represents the position of the jth spine at time s. contains just the spatial 
information about the jth spine up to time t (and whether or not it has died), 
but does not know which nodes of the tree actually make up that spine. 

The filtrations {gl'''---'''\t> 0): For each j-tuple h,.. we define 
where 

A'l = {{u = node(e,^)} : u G 17, s G [0, t]} 

and 

{{u < nodc(^j''), A„ a, CT„ < o-} : u G il, a > 2, o- G [0, oo)}. 

g{n,---:i]} contains all the information about the relevant collection of spines up 
to time t: which nodes make up the spines, their positions, and for all spine 
nodes not in N{t) (so all the strict ancestors of the spines at time t) their 
lifetimes and number of children. 
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The filtration {g^,t > 0): We use the shorthand = g{^- -M ^ ^^lat 
knows everything about the first k spines up to time t. Thus Qi is different 
from qI'^^ . 



4.5 Probability measures 

We may now take a probability measure P^; on T such that under P^,, the 
system evolves as a branching process starting with one particle at a;, each 
particle moves as a Markov process with generator C independently of all others 
given its birth time and position, and a particle at position y branches at rate 
R{y) into a random number of particles with distribution /ij,. This is the system 
described in Section [2] This measure, however, has no knowledge of the spines 
(since it sees only the filtration Ft). We would like to extend this to a measure 
on each of the finer filtrations . To do this, we imagine each spine, at each 
fission event, choosing uniformly from the available children. Then it is easy to 
see that, for any particle u in a marked tree T and any j > 1, we would like 



Prob(uee) = n T 



Ay 

We recall from Section [i] that if Y is an ^^'^-measurable random variable then 
we can write: 

vi,...,Vk&N{t)VJ{A} 

where each is J^t-measurable. (Here when we write we are 

talking really about the particle node(^f ) rather than its position.) 

Definition 4. We define the probability measure P^ on (T, J^), by setting 



fl[Y]=f>^ 



Di,....-UfcGAf(t)U{A} j = lu<v, " 



(4) 



for each J^j'^ -measurable Y with representation (|3|. Note that 



jfci 



1.7^=. 



In summary, particles carrying spines behave just as they would under P^., 
and when such a particle branches, each spine makes an independent choice 
uniformly from amongst the available children. 



4.6 Martingales and a change of measure 

As in Section [2] define T{i,j) :— mi{t > Q £,{}, and suppose that we 
are given a functional ({-^t), t > 0, such that Ci^^t) is a non-negative unit- 
mean martingale with respect to the natural filtration of the Markov process 
{Yt,t > 0) with generator C. We call ( the single-particle martingale. 

Recall that we defined skel'^'(t), the skeleton, to be the subtree up to time t 
generated by those particles carrying at least one of the k spines, 

skel''(i) ^ {u e -.Bs <t,j < k such that node(^^') = u}. 
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We also set 

D'"{v) = #{i <k:3t with v^^{} 
to be the number of spines foUowing particle v, and define 

E''{v,t) = exp J ^ (m^'^''\Xy{s)) - ij i?(X„(s))(isj . 

Since we will not always know which particles are the spines (when we are 
working on for example), it will sometimes be helpful to have the above 
concepts defined for a general skeleton of k particles Ui,. . . ,Uk instead of the 
spines. For this reason we define 

skel„i,..,,„fc(f) = {v €: Vt : (jy < t,3j with v < Uj}, 

Dui,...,uk{'v) = #{j : V < Uj}, 

and 

Eu„...,uAv,t) = exp (m^"i. - l) R{X,{s))ds^ 

so that 

skel*^(t) = skelji^ D''{v) = D^i^^_^^k^{v) and E''{v,t) := E^i ^,,,^^k^{v,t). 

Remark. We note that, with the notation given above, 

feskel„i,...,„s^(t)\JV(t) 

Definition 5. We define an .Fj'^-adapted (and, in fact, ^(''-adapted) process 
CHt), i > by 



t?€:skeP(t) t?eskeP(t)\A/(t) 

(if Ay = then we define ({Xy,Ty{t)) = 0) and an J^t-adapted process Z''{t), 
i > by 

Z^{t)= 51 l{C(«i,«)>o Vi=i,...,fc} n ' a \t)) -^"1 • • '"fc • 

ui,...,Uk^N(t) wGskel„i,...,„^(t) 

We remark here that and C,{^'' i ) iii fact, simply the projections of 
C,^ onto the relevant filtrations: 

Z\t) = n[C\t)\Tt] and C(e,i) = n[C\t)\gW 
Lemma 6. The process (''{t), t > is a martingale with respect to the filtrations 
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Proof. Let x = (wi, W2, • ■ •) be a single line of descent (so in particular vi < V2 < 
. . .), with xt representing the position of the unique Vi that is alive at time t. 
The births along x form a Cox process driven by Xt with rate function R. Thus 
for any j > 0, 



{m^{Xs)~l)R{Xs)d^ 



We work by induction on k. The case fc = 1 is just the single spine case, and is 
proved by conditioning first on Q^, since the births along the spine form a Cox 
process driven by with rate function R. Then, by induction, it is enough to 
consider the process up to the first split time of the skeleton, since after this 
time no particle carries more than A; — 1 spines. But up to the first split we have 
a single particle carrying k spines, so the same argument holds as for the single 
spine case: the births again form a Cox process driven by with rate function 
R. □ 



Definition 7. We define the measure by setting 



Cit). 



The proof that behaves as claimed in Section 2.1 is identical to the proof 
for one spine given by Chauvin and Rouault [5], applied to each branch of the 
skeleton independently. 



5 Proof of the many-to-few lemma 

We first calculate the probability that particles (ui, . . . , m^) make up the skeleton 
at time t. 

Lemma 8 (Gibbs-Boltzmann weights for Q'^). For any ui, . . .Uk G N{t) U {A}, 

1 



Proof. By the fact that Pj;[C(t)|J-t] = Z{t) and standard properties of condi- 
tional expectation. 



1 / T-r ax,,Mt)) 



n 



....^.,At)\N(t) ) 



. feskoi„j,...,„jt)\7V(i) 
1 T-r C(^„T„(t)) 



n 
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The proof of the niany-to-few lemma is now straightforward. 
Proof of Lemma We begin with the right-hand side. 



y n 



C(X,,T„(t)) Eiv,t) 



C(X„,a„(t)) 



«i,...,«fceiV(t)u{A} 



ueskcluj....,„j, (t) 



5] Y{m,...,uk) n 



ni,...,«fc6Ar(t)U{A} 



ueskcluj,...,„j.(t) 



C(X., Q^fei ^ z,!, . . . ^ UklJ't) ' 

{v,t) 



ui, ...,ukeN{t) 



Ij- Y{ui,...,uk)i{au^. 



t)>0 Vi=l,...,k} 



«i,...,«fcGW(t) 



«i,...,«fceAf(t) 



where for the last step we used the fact that 



U 



6 Many-to-few in discrete time 

We state here a version of the many-to-few lemma for discrete-time processes. 
We shall not prove it, as it is very similar to the continuous-time version studied 
above. 

We begin with one particle in generation located at a; G J. Any particle 
at position y has children whose number and positions are decided according to 
a finite point process "Dy on J. The children of particles in generation n make 
up generation n + 1. We define N{n) to be the total number of particles in 
generation n, and Xy to be the position of particle v. We set m^{y) — Vy[N{\y] 
to be the jth moment of the number of particles created by the point process 
Vy. Write \v\ to be the generation of particle v. For a particle v in generation 
n > 1, let p{v) be its parent in generation ri — 1. For any line of descent 
vo, vi, V2, ■ ■ ■ such that |w„| = n and p{vn+i) = Vn for each n > 0, we note that 
■ ■ is a Markov chain with some generator C not depending on 
the choice of fg, Wi, . . .. Suppose that C(^i n), n > is a functional of a process 
{Xn,n > 0) such that if (X„,n > 0) is a Markov process with generator C 
then C(X, n), n > is a martingale with respect to the natural filtration of 
(X„,n>0). 



6.1 The measure Q^. and the main result in discrete time 

We have k distinguished lines of descent just as in the continuous-time case, 
which we call spines. Under P, if a particle carrying j marks (i.e. the particle is 
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part of j spines) in generation n has I children in generation n + 1, then each of 
its j marks chooses a particle to follow in generation n+1 uniformly at random 
from the I children. We let be the position of the ith spine in generation n 
and define skel(n) to be the set of all particles of generation at most n which 
are part of at least one spine. Set Dy to be the number of marks carried by 
particle v. 

Under particles behave as follows: 

• A particle at position y carrying j marks has children whose number and 
positions are decided by a point process such that: 

- for each j and I > 0, Q^(Af(l) = = l^fy{N{l) = l)/Fy[N{iy]; 

— for each i, the sequence X^i,X^i^,X^.^, . . . is a Markov chain dis- 
tributed as if under the changed measure (5^|g{«} := C{Cn)F^\g{i}. 

• Given that a particles vi,. . . ,Va are born at such a branching event, the 
j spines each choose a particle to follow independently and uniformly at 

random, just as under Pj;. 

• Particles not in the skeleton (those carrying no marks) have children ac- 
cording to the point process Vy when at position y, just as under P. 

In other words, under spine particles move as if weighted by the martingale 
C, and they give birth to size-biased numbers of children. 

Lemma 9 (Many-to-few in discrete time). For any k > 1 and J^!^ -measurable 
Y such that 

^= Yl ^(^i'---'^fc)l{a=^i,...,£S=i'*>} 

vi,...,VkeN{n)U{A} 

we have 



Y{Vl, l'fc)l{C(wj,n)>0 Vi=l,...,fe} 

Vi,...,Vk&N{n) 



y n 



C(p(t;),|^|-1) n. 



j;eskel(n)\{0} 



CKH) 



TO^^(-)(Xp(,)) 
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